Sparse Latent Space Policy Search

نویسندگان

Kevin Sebastian Luck

Joni Pajarinen

Erik Berger

Ville Kyrki

Heni Ben Amor

چکیده

Computational agents often need to learn policies that involve many control variables, e.g., a robot needs to control several joints simultaneously. Learning a policy with a high number of parameters, however, usually requires a large number of training samples. We introduce a reinforcement learning method for sampleefficient policy search that exploits correlations between control variables. Such correlations are particularly frequent in motor skill learning tasks. The introduced method uses Variational Inference to estimate policy parameters, while at the same time uncovering a lowdimensional latent space of controls. Prior knowledge about the task and the structure of the learning agent can be provided by specifying groups of potentially correlated parameters. This information is then used to impose sparsity constraints on the mapping between the high-dimensional space of controls and a lowerdimensional latent space. In experiments with a simulated bi-manual manipulator, the new approach effectively identifies synergies between joints, performs efficient low-dimensional policy search, and outperforms state-of-the-art policy search methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Introducing Triple Play for Improved Resource Retrieval in Collaborative Tagging Systems

Collaborative tagging systems (like Flickr, del.icio.us, citeulike, etc.) are becoming more popular with passage of time. Users share their resources on tagging systems, and add keywords (called tags) to these resources. Users can search resources using these tags. But as the user gives more tags for search, he might not get sufficient search results, because the resources might not be tagged w...

متن کامل

Bayesian Learning in Sparse Graphical Factor Models via Variational Mean-Field Annealing

We describe a class of sparse latent factor models, called graphical factor models (GFMs), and relevant sparse learning algorithms for posterior mode estimation. Linear, Gaussian GFMs have sparse, orthogonal factor loadings matrices, that, in addition to sparsity of the implied covariance matrices, also induce conditional independence structures via zeros in the implied precision matrices. We d...

متن کامل

Bayesian Learning in Sparse Graphical Factor Models via Annealed Entropy

متن کامل

People Tracking with the Laplacian Eigenmaps Latent Variable Model

Reliably recovering 3D human pose from monocular video requires models that bias the estimates towards typical human poses and motions. We construct priors for people tracking using the Laplacian Eigenmaps Latent Variable Model (LELVM). LELVM is a recently introduced probabilistic dimensionality reduction model that combines the advantages of latent variable models—a multimodal probability dens...

متن کامل

Sparse Latent Semantic Analysis

Latent semantic analysis (LSA), as one of the most popular unsupervised dimension reduction tools, has a wide range of applications in text mining and information retrieval. The key idea of LSA is to learn a projection matrix that maps the high dimensional vector space representations of documents to a lower dimensional latent space, i.e. so called latent topic space. In this paper, we propose ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

Sparse Latent Space Policy Search

نویسندگان

چکیده

منابع مشابه

Introducing Triple Play for Improved Resource Retrieval in Collaborative Tagging Systems

Bayesian Learning in Sparse Graphical Factor Models via Variational Mean-Field Annealing

Bayesian Learning in Sparse Graphical Factor Models via Annealed Entropy

People Tracking with the Laplacian Eigenmaps Latent Variable Model

Sparse Latent Semantic Analysis

عنوان ژورنال:

اشتراک گذاری